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Networking genetic regulation and neural computation: Directed network topology 

and its effect on the dynamics 

Andreas Gr6nlund^'|j 
^Department of Physics, Umed University, 901 81 Umed, Sweden 

Two different types of directed networks are investigated, transcriptional regulation networks 
and neural networks. The directed network structure are studied and also shown to reflect the 
different processes taking place on the networks. The distribution of influence, identifled as the the 
number of downstream vertices, are used as a tool for investigating random vertex removal. In the 
transcriptional regulation networks we observe that only a small number of vertices have a large 
influence. The small influences of most vertices limit the effect of a random removal to in most cases 
only a small fraction of vertices in the network. The neural network has a rather different topology 
with respect to the influence, which are large for most vertices. To further investigate the effect of 
vertex removal we simulate the biological processes taking place on the networks. Opposed to the 
presumpted large effect of random vertex removal in the neural network, the high density of edges 
in conjunction with the dynamics used makes the change in the state of the system to be highly 
localized around the removed vertex. 

PACS numbers: 89.75.-k, 89.70.+C, 05.10.-a, 05.65.+b 
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I. INTRODUCTION 

In recent years complex networks have drawn a great 
attention from the physics community. Various measures 
have been introduced in order to capture the function 
and form of specific networks. An observed feature that 
many networks show are a scale free, or at least wide dis- 
tribution of the vertex degree, which is given a popular 
and well cited explanation in [y. Other studies includes 
measures of clustering, assortative mixing Q, between- 
ness centrality yLSIa- For a review of the recent work 
on networks see 0,0, lal- Many of the networks appearing 
in the real world arc directed and naturally the structure 
of two networks can be fundamentally different when the 
direction of the edges are considered, even if the over- 
all structure might be alike when the direction of the 
edges are not considered. Two examples of real world 
networks that are naturally directed are neural networks 
and transcriptional regulation networks. The former is 
the network of neurons where neurons are connected in 
a directed fashion where the axons of each neuron con- 
nects to one of another neuron's dendrites and in this way 
building up a directed network in which signals are sent 
(axons) and received (dendrites) by the individual neu- 
rons. In the transcriptional regulation networks, the ver- 
tices represents proteins and the edges are representing 
one proteins transcriptional regulation (positive and/or 
negative) of another protein. The cause of regulation is 
the attachment of a regulator protein to an operator posi- 
tion located on the DNA upstream of the gene coding for 
the regulated protein or if more than one protein, operon. 
The attachment responses either in an up-regulation or 
down-regulation of the transcription rate by RNAp of 
the specific gene and thus the production of the protein. 
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The reasons for regulation arc many of which one exam- 
ple is energy saving in poor nutritional environment since 
synthesis of RNA and protein both are energy expensive 
processes. Another example is the regulation of enzymes. 
The best studied case of enzyme induction involves the 
enzymes of lactose degradation in Escherichia coli. Only 
in the presence of lactose the enzymes that are neces- 
sary to utilize lactose as a carbon and energy source are 
synthesized. But it is not just a matter of the presence 
of lactose. If both glucose and lactose is present E. coli 
chooses glucose. This is transcriptionally regulated via 
both positive and negative control. 

The networks used in this paper are the neural network 
of the nematode Caenorhabditis elegance (NNCE) |3] , the 
transcriptional regulaton network of the bacteria E. coli 
(TREC) [13 and the transcriptional regulaton network 
of yeast, Saccharomyces cerevisiae (TRSC) jTj. 



II. STRUCTURAL PROPERTIES 

A directed edge is in the literature formally termed 
arc, which also will be the term used in this paper. Be- 
cause of the direction of the arcs one is able to follow 
directed paths in the network, representing the flow of 
information, the chain of command, or some other flow 
in the network. Depending on the system "living on" the 
network, the structure might look very different when 
the direction of the arcs in the different networks are 
taken into account. To get a first picture of what is go- 
ing on in the networks we look at the distribution of the 
number of vertices with just outgoing arcs, only incom- 
ing arcs, and with both outgoing and incoming arcs. In 
a network in which information is flowing like the neu- 
ral network a significant fraction of vertices should have 
both incoming and outgoing arcs in order to transport 
information between different parts of the network. Fig- 
ure n shows the distribution of the three different types 



of vertices in the networks. The neural network con- 
sists of mostly interneurons, that is neurons with both 
incoming and outgoing arcs, and have a low fraction of 
sensory neurons (only outgoing arcs) and motor neurons 
(only incoming arcs). For more information on interneu- 
rons, sensory neurons and motor neurons see |l2l |. The 
regulation networks have a different structure where the 
number of vertices of both incoming and outgoing arcs 
are suppressed and the network is dominated by vertices 
of only incoming links. 
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FIG. 1: The distribution of vertices with only incoming arcs, 
only outgoing arcs, and with both incoming and outgoing 
arcs. 




FIG. 2; The cumulative degree distribution of vertices with; 
only incoming arcs, only outgoing arcs, and with both incom- 
ing and outgoing arcs. 



Since the different neurons play a different role and 
have different functions, a natural question to ask would 
be if this is reflected in the degree of the different ver- 
tices. In the neural network the sensory neurons receive 
their signals not from other neurons but from receptors. 
The motor neurons transmits signals not to other neurons 
but to one or more effectors igniting chemical reactions 
like the ones responsible for the contraction of muscles. 
The sensory neurons collect information from the outside 
world which is passed on via the interneurons to various 
parts of the network. This defines the state of the system 
which is visible via a response in the motor neurons. The 
"end-station" of an input is not necessarily a specific mo- 
tor neuron. The inputs are collectively setting the whole 
network in different states, and thus produces different 
responses to different inputs. 

In figure 121 the degree distribution of the different net- 
works are plotted and one can observe that the transcrip- 
tional regulation networks (TRSC and TREC) are some- 
what similar in the sense that the degree of the vertices 
with only incoming arcs have a lower degree than the rest 
of the vertices. This indicates that the proteins with no 
control and with a position in analogy of a laborer tend 
to be controlled by a few proteins and often just one pro- 
tein which often has a high degree |l3l | and with mostly 
just outgoing arcs, a global controller. In the neural net- 
work the situation is precisely the opposite, the sensors 
have in general very few outgoing arcs, in fact often just 
one. The sensory neurons are in most cases connected 
to an interneuron of a relative high degree of incoming 
links from which it collects information from a number of 
sensory neurons. In only a few cases the sensory neurons 
are connected directly to a motor neuron. 



III. ROBUSTNESS VIA STRUCTURE 

Many networks are believed to have a modular struc- 
ture with functional modules in which communication is 
more present than between the modules. In addition to 
just having separate functions this also minimizes the in- 
fluence of a random change of the network. The modular- 
ity has been studied and detected in undirected networks 
jlil VVaL llfi| and network models [l3| ■ In nature there are 
many things found or believed to be modular [l^ [l^l 
where the separate modules are responsible for different 
functions and together serve as a unit in a larger system. 
The modularity of the transcriptional regulation network 
of S. cerevisiae (TRSC) has been studied [23|, and also 
the robustness in [2l|. Some modules are more impor- 
tant than others, and by removing a unit more or less of 
the function of the total system is removed. Since the 
transcriptional regulation networks serves as regulating 
systems of the production of various proteins with differ- 
ent tasks they need to be constructed to remain most of 
the functions even if subjected to random removal or ran- 
dom changes of proteins. Random changes are naturally 
present via mutations in the DNA. Besides the fact that 
the DNA contains "garbage" which reduces the proba- 
bility of removing important functions, one could ask if 
the transcriptional regulation networks have evolved to a 
structure which is robust to mutations and if it is possible 
to reveal and quantify the robustness with some measure 
of the structure? 

In the literature there are a number of different mea- 
sures of prestige and influence, see eg. |22|. Let Di be 
the number of downstream vertices of a vertex Vi, and 
define the influence li of the vertex Vi to be the fraction 
of vertices in the network which is downstream of vertex 
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The distribution of the influence P{I) of the vertices 
in the network gives information of how the influence 
and control are distributed in the network. Moreover, 
the distribution P(/) also gives information of how large 
fraction of the network that maximally (and typically) 
can be affected by a random change. For the function 
of the network to be stable to changes, the structure has 
to be designed in a way where most vertices only influ- 
ence a low fraction of the vertices in the network. But 
even though the stability is important, the function of the 
network might anyway need some vertices of great influ- 
ence or control, global controller proteins. The global 
controllers are needed for the response to nutritional el- 
ements C, O, N, P, heat shock, growth rate, and more. 
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FIG. 3; The distribution of influence P{I) of TREC and 
TRSC. The data is binned. 



In figure the distribution P(/) of TRSC and TREC 
is plotted. From the plot one can see that in TREC and 
TRSC that most of the proteins control only a very little 
part of the network. The implication of this is that a 
random mutation or deletion of a vertex (protein) affects 
only a very small part of the network. In the case of 
NNCE, the situation is the reverse as can be seen in table 
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I 


P{I) 


ASf 


0.0 


0.075 


0.0 


0.007 


0.014 


0.0030(0) 


0.939 


0.854 


0.0052(6) 


0.942 


0.043 


0.0035(8) 


0.946 


0.014 


0.0051(7) 



TABLE I: The influence /, the fraction of neurons of influence 
P{I), and the effect ASf of removal of a neuron of influence 
I. 



A random removal or damage of a neuron can possi- 
bly affect the whole network. How and to which extent 



probably depends on the exact dynamics of the network 
and the situation. The network is still likely to be con- 
nected after a random vertex removal because of the high 
density of arcs, however because of the high influence of 
the vertices a random removal of a neuron will possibly 
change the state of a large fraction of the other neurons, 
and thus the response to different inputs/stimuli. This 
is analyzed in the next section. 



IV. ROBUSTNESS VIA DYNAMICS 

To analyze whether the influence / of a vertex is of 
importance when considering vertex removal, two simple 
models are used, where one captures the nature of the 
interactions of the trancriptional networks and the other 
the neural networks. In [23J, the trancriptional network 
of S. cerevisiae (TRSC) is analyzed in terms of boolean 
network models with the aim of determining feasible rule 
structures. In their paper they flnd that many of the 
generated networks are shown to have a substantial part 
which is frozen in the sense that the final state is the 
same regardless of the initial states. As described be- 
fore the vertices in the trancriptional networks consists 
of proteins and the arcs represents one protein's regula- 
tion of another, in which the regulation can be either a 
positive regulation, an activator protein, or a negative 
regulation, a repressor. Also a study of the robustness of 
transcriptional regulation networks with the use of neural 
networks are done in |24l |. 

In the model that we use to simulate the transcrip- 
tional regulation, the state of a gene coding for a specific 
protein Vi has two values, expressed or not expressed; on 
or off. If the state of the gene coding for a specific pro- 
tein is Ojff there is no production, or at most a very small 
production, of the protein and is therefore not consid- 
ered to be present in the system. If the state is on there 
is a production enough for the protein considered to be 
present in the system. The state of a gene coding for a 
protein Wj is determined and regulated by the proteins Vj 
with arcs pointing towards Vi. The proteins with no in- 
coming arcs are determined from the initiation and can 
be considered as different environmental settings. The 
rules for how the update is done can be summarized as: 

• All vertices are randomly initiated with the value 
on or off. 

• The vertices Vi are then updated sequentially with 
the following rule until a final state is achieved: 

— The state of all vertices Vj pointing at Vi are 
determined. 

— If the state of a protein Vj with negative reg- 
ulation (repressor) is on, the state of protein 
Vi is off. 

— If no negative regulation is present, the regu- 
lation follows a majority rule and the state of 
the protein Vi is on/off if the majority of the 



state of the positive regulating proteins Vj are 
on/off. 

The update is illustrated in figure 01 The motivation for 
the model follows from the nature of the interactions. 
Negative regulation by a repressor blocks the produc- 
tion of a protein by binding to an operator downstream 
of the promoter of the gene that codes to the specific 
protein. When the repressor sits downstream of the pro- 
moter it stops the transcription of the downstream gene 
from RNAp. Positive regulation by an activator enhances 
the probability of RNAp to attach to the promoter of the 
gene and thereby the transcription of the gene that codes 
for the specific protein. 
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FIG. 4: The update rules for the protein regulation net- 
works. The positively regulating proteins, activator proteins, 
are treated with a majority rule illustrated in a) and b) and 
is override by a negative regulation, repressor, which is illus- 
trated in c) and d). 



The state of the total system is represented by a vec- 
tor of dimension N, (5'i,5'2, ...,Sn)- To investigate the 
effect of a random vertex deletion and to which extent the 
influence of the vertex plays a role for the state of the sys- 
tem, the final state obtained from an initial configuration 
is compared with the corresponding final state from the 
same initial configuration but with a vertex deleted from 
the network. The relationship of the influence of the re- 
moved vertex and the effect of the influence of the final 
state is demonstrated in figure |31 ASf is the fraction of 
vertices (proteins) in the network having a different fi- 
nal state after the removal of a vertex v. Only the initial 
configurations that converge to a final state is considered. 
The fraction of initial states that converge to a final state 
is approximately 1 for TREC and 0.7 for TRSC. The ini- 



tial configurations that do not converge to a final state 
ends up in an oscillatory state, and they are not con- 
sidered or investigated here. As one can see the overall 
behavior is that the difference in the final state ASf has 
a somewhat linear behavior of the influence / of the ver- 
tex V being removed. The plot of TRSC do not follow the 
straight line approximation for larger values of /, which 
might indicate that the measure of influence used here is 
not perfectly suited for the applied dynamics. 
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FIG. 5: The difference of the final state ASf versus vertex 
removal of a vertex with influence I. The lines are straight 
line approximations to guide the eye and the plotted data is 
binned. The errors are smaller than the symbol sizes. 



Since the networks have a fix structure, the values for 
the difference in the final states ASf for the different in- 
fluences / does not all follow the approximated straight 
line as can be observed, but the calculated error for the 
individual values of ASf are nevertheless small. Since 
the dynamics incorporates a majority rule, the effect of 
a vertex removal decays with the distance from the re- 
moved vertex and therefore only a fraction of the down- 
stream vertices get a different state after the removal. 
How large fraction depends therefore on the structure of 
the network and the typical distance to the downstream 
vertices from the removed one. 

Figure El shows the fraction of all vertices with dis- 
tance d from the removed vertex which have a different 
final state ASf.d compared with the final state before 
the removal. Except from the exponential decay, one 
can also observe that the longest directed path is only 
of four steps in TREC and of six in TRSC. As a com- 
parison the diameter in TREC is 13 and in TRSC 14. 
Since the fraction of vertices with a different state drops 
exponentially with distance a refinement of the measure 
of influence used here would be the measure of proximity 
prestige (see |22j). which is a vertex's number of down- 
stream vertices normalized with their average distance to 
the vertex. 

NNCE is applied to a similar dynamics as the tran- 
scriptional regulation networks, with the only difference 
that there are no negative regulations which overrule 
the positive ones. The neurons are thus treated as 
McCullough-Pitts neurons |2^ |2g with binary states 
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FIG. 6: The fraction of vertices with a different state AS/.d 
plotted against distance of the removed vertex. The data is 
fitted to exponential functions. The errors are smaller than 
the symbol sizes. 



FIG. 7: The fraction of vertices with a different state ASf.d 
plotted against distance of the removed vertex. The data is 
fitted to an exponential function. 



on/off, and with equal and positive synaptic coupling 
strength (excitatory) and with a threshold of ni/0.5, 
where rii are the number of inputs. All neurons arc con- 
sidered to be excitatory, that is in a state ojf when the 
input is below the threshold and on if the input is above 
the threshold. The update of the state of each neuron is 
therefore simply a majority rule, that is the state Si of a 
neuron Vi is on/off if the majority of the state of the in- 
coming signals are on/off. If there are no majority, that 
is the number of on inputs are equal to the number of off 
inputs the state of the neuron defined to be on. Since the 
influence of the neurons in the network are concentrated 
to a value around / = 0.94 a linear dependence of the 
difference in the state ASf to the influence / is thus not 
achievable. However, we can still get information of the 
effect of a vertex removal just by looking at the average 
difference in the final state from a deletion of a neuron. 
As before the results are averaged by a number of differ- 
ent initial configurations and different vertex removals. 

The results of the simulations done for NNCE are sum- 
marized in table Q] As one can see, the change in the 
state of the system is very small even if the influence of 
the removed vertex is large. The fact that the network is 
very dense and that the dynamics follow a majority rule 
implies that the change in the state of the system from 
a deletion of an individual neuron is small. The change 
in the state from the removal of a single neuron is sim- 
ply averaged out in most cases, but there are of course 
changes in the state of some neurons located nearby the 
removed neuron. A further study would be to see how 
the state of the system responds when deleting a neigh- 
borhood of neurons to resemble a more realistic physical 
damage. One of the conclusions one can make is that 
even if the influence of most vertices are large, the dy- 
namics put on the network results in a situation where 
the network is not very affected by a random removal of 
a single neuron. Figure |7| shows the decay of the fraction 
of changed states with the distance, and like the tran- 
scriptional regulation networks the decay fits well to an 
exponential decay. 



V. SUMMARY 

The two directed types of networks analyzed here are 
shown to have a different structure in various measures 
which incorporates the direction of the edges. The neu- 
ral network of C. elegance consists of mostly vertices with 
both incoming and outgoing arcs, interneurons, possibly 
due to the fact that it is an information network in which 
information is processed and spread between different 
parts of the network. There is also a fraction of ver- 
tices with only outgoing arcs, sensory neurons, that feed 
the network with external information. Finally, there is a 
small fraction of vertices with only incoming arcs, motor 
neurons, responsible for igniting chemical reactions like 
contraction of muscles. 

We find that the protein regulation networks of E. coli 
and S. cerevisae both show a small fraction of proteins 
with both incoming and outgoing arcs, and the dominat- 
ing part of the network consists of proteins that only have 
incoming arcs, which we term laborers in the analogy of 
a human laborer which only has a small influence in the 
system he or she works in. The laborers are possibly used 
as building blocks or as components in different biochem- 
ical processes. There are also proteins that have a large 
number of outgoing arcs and thus globally controls the 
production of many proteins, where most of them are 
laborers. 

The influence of the vertices, deflned for a vertex as the 
fraction of vertices in the network situated downstream 
of the vertex, are limited for most vertices in the tran- 
scriptional regulation networks. The neural network is 
showing a substantial part of the network to influence 
almost all vertices in the network. We simulate the bio- 
logical processes on the networks and we investigate re- 
lationship of the influence of a removed vertex with the 
change in the state of the biological system. In the pro- 
tein regulation networks the effect of a random removal 
are limited in most cases due to the fact that the influ- 
ence of most proteins are restricted to a small number 
of proteins. However a removal of a protein of great in- 
fluence changes the state of the system more since the 



change in the state is shown to increase, with some ex- 
ceptions, Hnearly with the influence. In the case of the 
neural network where the influence of most vertices are 
fairly large, the great number of arcs suppresses the ef- 
fect of a removal of an individual neuron since the state 
of each neuron obeys a majority rule of the states of the 
incoming signals. Since the changes in the states of the 
vertices due to a vertex removal is shown to decay expo- 
nentially with the distance from the removed vertex, a 
proper refinement of the measure of influence could be to 
include the typical distance to the downstream vertices 



in the measure of influence, just like the measure of prox- 
imity prestige. A remark with the previous observations 
in mind would therefore be that the influence or pres- 
tige in a network (directed or not) probably to a large 
extent depends on the dynamics applied to the network, 
and therefore every investigation of prestige or influence 
should be in conjunction with the dynamics applied to 
the network. 

Many thanks to Petter Holme and Martin Rosvall for 
your helpful comments and beneflcial discussions con- 
tributing to this paper. 
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